Using Heuristics Based Approach for Segmentation and Recognition of Printed Arabic Characters

نویسنده

  • Ihab Zaqout
چکیده

In this study, we propose a flexible template-matching algorithm for word segmentation, and structural analysis of features extraction is used for character recognition in the printed Arabic text. The input text image is preprocessed by the binarization and then by morphological operations. A vector quantization of the thinned image (VQTM) is created based on the idea of a freeman chain code tracking method. In the segmentation process, 113 character templates are compared for partially/completely existence in the VQTM. A nonlinear filter is applied on the segmented regions to extract the termination and bifurcation features. The spatial distribution of the extracted features and other statistical characteristics are analyzed for the verification of recognition. Experimental results show that the overall recognition rate of the three fonts: Arabic transparent, simplified Arabic and traditional Arabic is 98.63%.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Printed Arabic optical character segmentation

A considerable progress in recognition techniques for many non-Arabic characters has been achieved. In contrary, few efforts have been put on the research of Arabic characters. In any Optical Character Recognition (OCR) system the segmentation step is usually the essential stage in which an extensive portion of processing is devoted and a considerable share of recognition errors is attributed. ...

متن کامل

Arabic Character Segmentation Using Projection Based Approach with Profile's Amplitude Filter

Arabic is one of the languages th challenges to Optical character recognition ( challenge in Arabic is that it is mostly curs segmentation process must be carried out character’s start and end. This step is essen recognition. This paper presents Ar segmentation algorithm. The proposed alg projection-based approach concepts to separ and characters. This is done using profile's and simple edge to...

متن کامل

Off-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model

In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...

متن کامل

Neural Network Based Segmentation Algorithm for Arabic Characters Recognition

This paper presents a novel holistic technique for classifying Arabic handwritten text documents, which it is performed in several steps. First, the Arabic handwritten document images are segmented into their connected parts. A simple heuristic segmentation algorithm is used which finds segmentation points in printed and cursive handwritten words. Second, several features are extracted from the...

متن کامل

An Adaptive Algorithm for the Automatic Segmentation of Printed Arabic Text

Character segmentation is a crucial step in most Arabic optical text recognition systems. The recognition process depends mainly on the accuracy of the character segmentation. This paper presents a novel adaptive algorithm for the off-line segmentation of printed Arabic text. There are many challenging features in the Arabic writing, for example, it is cursive and characters in a word can take ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012